Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add filters

Language
Document Type
Year range
1.
PLoS Comput Biol ; 18(11): e1010649, 2022 11.
Article in English | MEDLINE | ID: covidwho-2140357
2.
Environmental science & technology letters ; 2022.
Article in English | EuropePMC | ID: covidwho-1905152

ABSTRACT

Wastewater surveillance has rapidly emerged as an early warning tool to track COVID-19. However, the early warning measurement of new SARS-CoV-2 variants of concern (VOCs) in wastewaters remains a major challenge. We herein report a rapid analytical strategy for quantitative measurement of VOCs, which couples nested polymerase chain reaction and liquid chromatography–mass spectrometry (nPCR-LC-MS). This method showed a greater selectivity than the current allele-specific quantitative PCR (AS-qPCR) for tracking new VOC and allowed the detection of multiple signature mutations in a single measurement. By measuring the Omicron variant in wastewaters across nine Ontario wastewater treatment plants serving over a three million population, the nPCR-LC-MS method demonstrated a better quantification accuracy than next-generation sequencing (NGS), particularly at the early stage of community spreading of Omicron. This work addresses a major challenge for current SARS-CoV-2 wastewater surveillance by rapidly and accurately measuring VOCs in wastewaters for early warning.

3.
Virus Evol ; 7(2): veab092, 2021 Dec.
Article in English | MEDLINE | ID: covidwho-1623514

ABSTRACT

Phylogenetics has played a pivotal role in the genomic epidemiology of severe acute respiratory syndrome coronavirus 2, such as tracking the emergence and global spread of variants and scientific communication. However, the rapid accumulation of genomic data from around the world-with over two million genomes currently available in the Global Initiative on Sharing All Influenza Data database-is testing the limits of standard phylogenetic methods. Here, we describe a new approach to rapidly analyze and visualize large numbers of SARS-CoV-2 genomes. Using Python, genomes are filtered for problematic sites, incomplete coverage, and excessive divergence from a strict molecular clock. All differences from the reference genome, including indels, are extracted using minimap2 and compactly stored as a set of features for each genome. For each Pango lineage (https://cov-lineages.org), we collapse genomes with identical features into 'variants', generate 100 bootstrap samples of the feature set union to generate weights, and compute the symmetric differences between the weighted feature sets for every pair of variants. The resulting distance matrices are used to generate neighbor-joining trees in RapidNJ that are converted into a majority-rule consensus tree for each lineage. Branches with support values below 50 per cent or mean lengths below 0.5 differences are collapsed, and tip labels on affected branches are mapped to internal nodes as directly sampled ancestral variants. Currently, we process about 2 million genomes in approximately 9 h on 52 cores. The resulting trees are visualized using the JavaScript framework D3.js as 'beadplots', in which variants are represented by horizontal line segments, annotated with beads representing samples by collection date. Variants are linked by vertical edges to represent branches in the consensus tree. These visualizations are published at https://filogeneti.ca/CoVizu. All source code was released under an MIT license at https://github.com/PoonLab/covizu.

4.
Virus Evol ; 7(1): veaa106, 2021 Jan.
Article in English | MEDLINE | ID: covidwho-1045826

ABSTRACT

Many virus-encoded proteins have intrinsically disordered regions that lack a stable, folded three-dimensional structure. These disordered proteins often play important functional roles in virus replication, such as down-regulating host defense mechanisms. With the widespread availability of next-generation sequencing, the number of new virus genomes with predicted open reading frames is rapidly outpacing our capacity for directly characterizing protein structures through crystallography. Hence, computational methods for structural prediction play an important role. A large number of predictors focus on the problem of classifying residues into ordered and disordered regions, and these methods tend to be validated on a diverse training set of proteins from eukaryotes, prokaryotes, and viruses. In this study, we investigate whether some predictors outperform others in the context of virus proteins and compared our findings with data from non-viral proteins. We evaluate the prediction accuracy of 21 methods, many of which are only available as web applications, on a curated set of 126 proteins encoded by viruses. Furthermore, we apply a random forest classifier to these predictor outputs. Based on cross-validation experiments, this ensemble approach confers a substantial improvement in accuracy, e.g., a mean 36 per cent gain in Matthews correlation coefficient. Lastly, we apply the random forest predictor to severe acute respiratory syndrome coronavirus 2 ORF6, an accessory gene that encodes a short (61 AA) and moderately disordered protein that inhibits the host innate immune response. We show that disorder prediction methods perform differently for viral and non-viral proteins, and that an ensemble approach can yield more robust and accurate predictions.

SELECTION OF CITATIONS
SEARCH DETAIL